Compositions isolated from plant cells and their use in the modification of plant cell signaling

ABSTRACT

Novel isolated polynucleotides that encode polypeptides involved in plant cell signaling are provided, together with genetic constructs comprising such polynucleotides. Methods for using such constructs for the modulation of cell signaling in plants are also disclosed, together with transgenic plants comprising such constructs.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation-in-part of co-pending U.S. Patent Application No. 09/704,302, filed Nov. 1, 2000, which is a continuation-in-part of U.S. Patent Application No. 09/228,986, filed Jan. 12, 1999, and claims the benefit of priority to U.S. Provisional Patent Application No. 60/162,866, filed Nov. 1, 1999, and PCT Application No. PCT/US00/00724, filed Jan. 11, 2000. The disclosures of the aforementioned applications are herein incorporated by reference in their entirety.

REFERENCE TO SEQUENCE LISTING SUBMITTED ON COMPACT DISC

[0002] This application incorporates by reference in its entirety the Sequence Listing that is provided in duplicate on compact discs that accompany the application. Each CD contains the following file: 1020C2, having a date of creation of Mar. 18, 2002 and a file size of 2.90 MB.

FIELD OF THE INVENTION

[0003] This invention relates to the field of modifying the responses of plant cells to external signals, such as environmental changes, and developmental cues. More specifically, this invention provides isolated polynucleotides encoding polypeptides that are integrally located in plant cell membranes and that mediate cellular signaling processes.

BACKGROUND OF THE INVENTION

[0004] Plants progress through set developmental programs throughout the course of their lifetimes. This is particularly evident in embryogenesis and floral development. There are a variety of signal molecules produced by certain cells in the plant to which other cells, particularly in the meristematic regions, are poised to respond. These signal molecules trigger distinct sets of developmental programs at specific times that lead to the formation of, for example, flowers or cotyledons. In addition to the programmed developmental pathways, plants are exposed to a variety of environmental stimuli such as changes in temperature and amount of sunlight, availability of water, wounding from mechanical injury and attack by pathogens. Environmental factors, such as exposure to light, heat, cold, drought, etc., activate the expression of genes and synthesis of proteins and other compounds essential for an appropriate response to the environmental signal and thereby, the healthy development of the plant. These responses, like the developmental pathways, are mediated by signal molecules (hereinafter referred to as ligands).

[0005] To respond to these ligands, plant cells produce surface receptor proteins that serve as sensors, regulators and/or transducers of cell signals. The intracellular transduction of a signal is often transmitted via a phosphorylation cascade of molecules that culminates in the transcription of genes to elicit the appropriate cellular response either for normal development or against environmental challenge.

[0006] One major class of receptor proteins is the single-transmembrane family, of which there are several subclasses. These proteins are characterized by three domains: an extracellular signal molecule recognition/binding domain, a single cell membrane-spanning domain and an intracellular signal transduction domain which is usually a protein kinase. Many, but not all, plant single transmembrane proteins belong to the subclass known as receptor-like kinases (RLKs). The intracellular kinase domains of plant RLKs are all serine/threonine protein kinases, while the extracellular domains of RLKs are of different types. One type of RLK is characterized by the presence of the extracellular S-domain, originally described in self-incompatibility-locus glycoproteins that inhibit self-pollination. The S-domain is recognized by an array of ten cysteine residues in combination with other conserved residues. Another class of RLKs has an extracellular domain distinguished by leucine rich repeats (LRR) that are involved in protein-protein interactions. Binding of ligands to the extracellular domain is followed by receptor dimerization, autophosphorylation and the activation of a series of intracellular proteins which serve to transduce the signal to the nucleus. The structure of plant RLKs is very similar to receptors found in cell signaling pathways in animal systems.

[0007] One example of a plant RLK is the Xa21gene, which confers resistance to the plant pathogen Xanthomonas oryzae pv. oryzae race 6. This gene was cloned using genetic means comparing Xanthomonas-sensitive and resistant strains of rice (Song et al., Science 270:1804-1806, 1995), and has been subsequently shown to confer resistance to Xanthomonas in Arabidopsis. The 1025 amino acid protein shows a number of features with similarity to known protein domains including a NH₂-terminal 23 amino acid residue signal peptide, indicating that the protein is directed to the plasma membrane. Amino acids 81 to 634 contain 23 imperfect copies of a 24-amino acid LRR. Amino acids 651 to 676 encode a 26-amino acid hydrophobic segment that is likely to form a membrane-spanning domain. The C-terminal amino acids contain a putative intracellular serine threonine kinase domain carrying 11 subdomains with all 15 invariant amino acids that are typical of protein kinases. Subdomains VI and VIII are indicative of serine-threonine phosphorylation specificity. Xa21has strong similarities to other RLKs, such as the Arabidopsis receptor-like kinase proteins RLK5 (HAESA) and TMK1, showing conservation of both the LRR and protein kinase domains. It is not yet known to what protein Xa21transduces its pathogen recognition signal.

[0008] Another family of membrane receptor molecules expressed by plant cells is histidine kinases (HKs). HKs have been known for some time in bacterial signal transduction systems, where they form one half of a two-component signaling system. The bacterial HK serves as a sensor molecule for extracellular signals, such as changes in osmoticum, nutrients and toxins. The HK autophosphorylates on a histidine residue in response to ligand binding. This phosphohistidine donates its phosphate group to an aspartate residue of the second member of the two component system, known as the response regulator (RR). The phosphorylated RR then goes on to further transduce the signal, by binding other proteins as regulatory subunits, thereby either activating or inactivating them, depending on the specific circumstance. Alternatively, the phosphorylated RR binds DNA in a sequence-specific manner, serving to directly activate specific genes which code for proteins that mediate the response to the extracellular stimulus. In certain cases, HKs have a composite structure. Specifically, these proteins contain RR domains at their carboxy termini. The phosphohistidine of the HK transfers its phosphoryl group to the active site aspartate residue of this RR domain. In these cases, since the RR domain is membrane-bound, the signal cannot be transduced directly by RR binding to DNA. Instead, histidine phosphotransfer (HPt) proteins serve to further transduce the signal. The phosphoaspartate of the composite HK/RR protein donates the phosphate group to an active site histidine in the HPt protein. The HPt phosphohistidine in turn donates the phosphate group to a true RR, which then modulates activities of other proteins or activates gene expression in response to the external signal.

[0009] Like bacteria, plant cells have several two-component signaling systems which consist of a sensor element HK and a RR. In addition, composite HK proteins with RR domains at their carboxy termini (hereinafter referred to as hybrid HK/RR proteins) are found in both bacteria and plants. The HK proteins are distinguished by well-conserved amino acid motifs that occur in a specific order. From the amino terminus, the conserved regions are identified as the H, N, G1, F and G2 boxes. These motifs are usually found within a 200-250 amino acid span of the protein. The G1, F and G2 boxes are thought to be involved in nucleotide binding. As in bacteria, upon receiving the extracellular ligand, the HK is autophosphorylated on the histidine residue contained in the H box. The phosphate group is subsequently transferred to the RR. Alternatively, some HKs constitutively autophosphorylate their histidine residues and this activity is suppressed by binding of the extracellular ligand. All HKs are believed to phosphorylate a RR, as an obligate part of signal transduction.

[0010] RRs are characterized by the absolute conservation of an aspartate which is phosphorylated by the phosphohistidine of the HK, and a conserved lysine residue. Unlike bacteria, RRs in plants have not been shown to bind DNA directly. Rather, all the plant RR's characterized to date appear to transduce the signal into protein kinase cascades, which eventually phosphorylate and either activate or inactivate transcription factors, and thereby gene expression. Similar to bacteria, plants also contain hybrid HK/RRs which contain a RR domain at the carboxy terminus of the protein. As might be expected based on this observation, plant. genomes have also been found to harbor histidine-containing phosphotransfer (HPt) domain genes. The HPt domain has been shown to play an important role in some His-Asp phosphorelay pathways. However, it has not yet been shown directly that any plant HPt protein interacts either with a hybrid HK/RR or with a soluble RR.

[0011] The ethylene receptor family (e.g., ETR1; Chang et al., Science 262:539-544, 1993) comprise the best known two-component signaling system in plants. Ethylene is a well-known ligand that is involved in the coordination of fertilization, senescence, skoto/photomorphogenesis, and responses to pathogens and mechanical injury. The ethylene signal is transduced through the protein CTR1, which is a Raf-like protein kinase. CTR1 is a negative regulator of downstream steps in the signaling pathway. While the details of this pathway remain unclear, it appears that the ethylene receptors are constitutively active in the absence of ethylene, thereby constantly phosphorylating CTR1, which in turn represses other genes in the ethylene response pathway. Binding of ethylene to the ethylene receptors inhibits the phosphlorylation function of the receptor, which results in the inhibition of the negative regulator CTR1, thereby allowing the activation of downstream proteins in the ethylene signal transduction cascade. This culminates in activation of ethylene response genes.

[0012] Two RR genes, IBC6 and IBC7, which are induced in response to the plant growth regulator cytokinin, have been cloned from Arabidopsis thaliana and characterized (Brandstatter and Kieber, Plant Cell 10:1009-1019, 1998). Cytokinin is known to regulate plant growth and development, including such physiological events as nutrient metabolism, expansion and senescence of leaves, and lateral branching. It is likely that IBC6 and IBC7 are involved in the transduction of the cytokinin signal in plants. Consistent with such a hypothesis, it has been demonstrated that ARR4 (IBC7) interacts directly with phytochrome B (phyB) (Sweere et al., Science 294:1108-1111). PhyB is one of a family of histidine kinase-like photoreceptor molecules (Thummler et al., FEBS Lett. 357:149-155, 1995; Kehoe and Grossman, Science 273:1409-1412, 1996) involved in photo/skotomorphogenesis and physiological responses to day length, such as initiation of flowering. The finding of IBC7's interaction with phyB is a link between cytokinin perception and cytokinin's physiological effects on prevention of light deprivation-mediated etiolation and senescence. Furthermore, the response regulator ARR1 has been shown to function as a transcription factor which directs the transcription of ARR6, another response regulator that is transcribed in direct response to cytokinin, like IBC6 and IBC7 (Sakai et al., Science 294:1519-1521, 2001). The proteins encoded by these RR genes are all possible signal transduction partners with the hybrid HK/RR known as CRE1/WOL. CRE1/WOL was recently shown to directly bind cytokinin and transduce a signal in transgenic yeast cells (Inoue et al., Nature 409:1060-1063, 2001; Suzuki et al., Plant Cell Physiol. 42:107-113, 2001). Subsequent studies have further shown that CRE1/WOL directly binds cytokinin in Arabidopsis thaliana (Yamada et al., Plant Cell Physiol. 42:1107-1123, 2001; Ueguchi et al., Plant Cell Physiol. 42:751-755, 2001). Furthermore, the gene encoding the hybrid HK/RR protein CKI1 causes cytokinin-like effects when it is ectopically expressed in transgenic plants (Kakimoto, Science 274:982-985, 1996). However, the role that the CKI1 protein plays in cytokinin signal transduction, if any, is still unclear. It is possible that any or all of the known plant response regulators may also interact with CKI1 to mediate other aspects of the cytokinin response. However, it is clear that a two-component HK/RR system is involved in cytokinin signal transduction.

[0013] While polynucleotides encoding proteins involved in plant cell signaling have been isolated for certain species of plants, genes encoding many such proteins have not yet been identified in a wide range of plant species. Thus, there remains a need in the art for materials which may be usefully employed in the modification of cell signaling in plants.

SUMMARY OF THE INVENTION

[0014] Briefly, the present invention provides polynucleotides isolated from eucalyptus and pine which encode polypeptides involved in cell signaling, together with methods for the use of such polynucleotides and polypeptides. Such polypeptides function as sensor-regulators or receptor kinases. The isolated polynucleotides and polypeptides may be usefully employed in the modification of plant cell responses either during the growth and development of a plant, or under conditions of stress resulting from pathogens or environmental factors.

[0015] In a first aspect, the present invention provides isolated and purified polynucleotides obtainable from eucalyptus and pine which encode RLKs, HKs, RRs, HPts or hybrid HK/RR proteins. In one embodiment, the isolated polynucleotides comprise a DNA sequence selected from the group consisting of: (a) sequences recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (b) complements of the sequences recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (c) reverse complements of the sequences recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (d) reverse sequences of the sequences recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; and (e) sequences having either 75%, 90% or 95% identity, as defined herein, to a sequence of (a) -(d).

[0016] In a further aspect, isolated polypeptides encoded by an inventive polynucleotide are provided. In certain embodiments, such polypeptides comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989; and sequences having at least 75%, 90% or 95% identity to a sequence of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989.

[0017] In another aspect, the invention provides genetic constructs comprising a polynucleotide of tie present invention, either alone, in combination with one or more other polynucleotides disclosed herein, or in combination with one or more known DNA sequences, together with transgenic cells comprising such constructs.

[0018] In a related aspect, the present invention provides genetic constructs comprising, in the 5′-3′ direction, a gene promoter sequence; an open reading frame coding for at least a functional portion of a polypeptide of the present invention; and a gene termination sequence. The open reading frame may be orientated in either a sense or antisense direction. Genetic constructs comprising an untranslated, or non-coding, region of a gene coding for an inventive polypeptide or a nucleotide sequence complementary to a non-coding region, together with a gene promoter sequence and a gene termination sequence, are also provided. Preferably, the gene promoter and termination sequences are functional in a host plant. Most preferably, the gene promoter and termination sequences are those of the original genes but others generally used in the art, such as the Cauliflower Mosaic Virus (CaMV) promoter, with or without enhancers such as the Kozak sequence or Omega enhancer, and Agrobacterium tumefaciens nopaline synthase terminator may be usefully employed in the present invention. Tissue-specific promoters may be employed in order to target expression to one or more desired tissues. The genetic construct may further include a marker for the identification of transformed cells.

[0019] In a further aspect, transgenic cells, preferably plant cells, comprising the genetic constructs of the present invention are provided, together with organisms, preferably plants, comprising such transgenic cells, and fruit and seeds and other products, derivatives, or progeny of such plants. Propagules of such transgenic plants are also encompassed in the present invention. As used herein, the word “propagule” means any part of a plant that may be used in reproduction or propagation, sexual or asexual, including cuttings

[0020] In yet another aspect, methods for modifying cell signaling in a target organism, such as a plant, are provided, such methods including stably incorporating into the genome of the plant a genetic construct of the present invention. In a preferred embodiment, the target plant is a woody plant, preferably selected from the group consisting of eucalyptus and pine species, most preferably from the group consisting of Eucalyptus grandis and Pinus radiata. In a related aspect, a method for producing a target organism, such as a plant, having modified cell signaling is provided, the method comprising transforming a plant cell with a genetic construct of the present invention to provide a transgenic cell and cultivating the transgenic cell under conditions conducive to regeneration and mature plant growth.

[0021] In yet a further aspect, the present invention provides methods for modifying the activity of a polypeptide in a target organism, such as a plant, comprising stably incorporating a genetic construct of the present invention into the genome of the plant. In a preferred embodiment, the target plant is a woody plant, preferably selected from the group consisting of eucalyptus and pine species, most preferably from the group consisting of Eucalyptus grandis and Pinus radiata.

[0022] The above-mentioned and additional features of the present invention and the manner of obtaining them will become apparent, and the invention will be best understood by reference to the drawings and the following more detailed description. All references disclosed herein are hereby incorporated by reference in their entirety as if each was incorporated individually.

BRIEF DESCRIPTION OF THE DRAWINGS

[0023]FIG. 1A shows the amino acid sequence of SEQ ID NO: 76, with the signal sequence and transmembrane domain being underlined. In the extracellular domain, twelve leucine-rich repeats (boxed) are flanked by two pairs of conservatively spaced cysteines (in bold). In the leucine-rich repeat region, a conserved sequence characteristic of the kinase family has been identified. The alignment of the leucine-rich repeats with this conserved element is shown in FIG. 1B. The intracellular domain is characterized by the eleven conserved protein kinase domains (Hanks and Quinn, Methods Enzymol. 200:38-62, 1991) marked with Roman numerals. The fifteen residues invariant in protein kinases and the conserved cysteine residue are in bold.

[0024]FIG. 1B shows the alignment of leucine-rich repeats in the extracellular domain of SEQ ID NO: 76. The numbers to the right of the alignment indicate the number of leucine-rich repeats. A six-amino acid signature inserted in LRR5 is characteristic of this family of receptor-like kinases. The consensus sequence shown below the alignment is commonly found in other plant proteins containing LRRs (Li and Chory, Cell 90:929-938, 1997).

[0025]FIG. 2 illustrates the amino acid sequence of SEQ ID NO: 114, showing a signal peptide and three transmembrane domains. Other conserved boxes characteristic of hybrid HK/RR proteins were also identified (boxed): an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0026]FIG. 3 illustrates the amino acid sequence of SEQ ID NO: 906, showing the following conserved boxes characteristic of hybrid HK/RR proteins (boxed): an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0027]FIG. 4 illustrates the amino acid sequence of SEQ ID NO: 908, showing the Hpt domain (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0028]FIG. 5 illustrates the amino acid sequence of SEQ ID NO: 909, showing the following conserved boxes characteristic of hybrid HK/RR proteins (boxed): an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0029]FIG. 6 illustrates the amino acid sequence of SEQ ID NO: 975, showing the following conserved boxes characteristic of RRs (boxed): an DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0030]FIG. 7 illustrates the amino acid sequence of SEQ ID NO: 956, showing a signal peptide and two transmembrane domains. The following conserved boxes characteristic of HK/RR proteins (boxed) were identified: an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0031]FIG. 8 illustrates the amino acid sequence of SEQ ID NO: 957, showing two transmembrane domains. The following conserved boxes characteristic of hybrid HK/RR proteins (boxed) were identified: an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0032]FIG. 9 illustrates the amino acid sequence of SEQ ID NO: 976, showing two transmembrane domains. The following conserved boxes characteristic of hybrid HK/RR proteins (boxed) were identified: an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0033]FIG. 10A illustrates the nucleotide sequence of SEQ ID NO: 961, encoding a histidine kinase. In a splice variant of SEQ ID NO: 961 (SEQ ID NO: 962; shown in FIG. 11A), the nucleotides in bold and underlined are spliced out to produce a shorter variant of the protein. The amino acid sequence encoded by SEQ ID NO: 961 (SEQ ID NO: 977 ) is given in FIG. 10B, and the amino acid sequence of the splice variant (SEQ ID NO: 978) is given in FIG. 11B.

[0034]FIG. 10B illustrates the amino acid sequence of SEQ ID NO: 977, showing two transmembrane domains. The following conserved boxes characteristic of hybrid HK/RR proteins (boxed) were identified: an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. PhysioL. 41:139-227, 1999).

[0035]FIG. 11A illustrates the nucleotide sequence of SEQ ID NO: 962, encoding a splice variant of the histidine kinase given in SEQ ID NO: 961 (FIG. 10A). The amino acid sequence encoded by the splice variant (SEQ ID NO: 978) is given in FIG. 11B.

[0036]FIG. 11B illustrates the amino acid sequence of SEQ ID NO: 978, which is a splice variant of SEQ ID NO: 977. Two transmembrane domains (underlined) and a conserved boxe H-box (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999) are present in this variant.

[0037]FIG. 12 illustrates the amino acid sequence of SEQ ID NO: 979, showing two transmembrane domains. The following conserved boxes characteristic of hybrid HK/RR proteins (boxed) were identified: an H-box, N-box, G1-box, F-box, G2-box, DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0038]FIG. 13 illustrates the amino acid sequence of SEQ ID NO: 980, showing the following conserved boxes characteristic of RRs (boxed): a DD-box, D-box and KP-box (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0039]FIG. 14 illustrates the amino acid sequence of SEQ ID NO: 981, showing the following conserved boxes characteristic of histidine kinases (boxed): an H-box, N-box and G1-box (Grebe and Stock, Adv. Microb. PhysioL 41:139-227, 1999).

[0040]FIG. 15 illustrates the amino acid sequence of SEQ ID NO: 982, showing a leucine zipper (underlined).

[0041]FIG. 16 illustrates the amino acid sequence of SEQ ID NO: 854, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0042]FIG. 17 illustrates the amino acid sequence of SEQ ID NO: 983, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0043]FIG. 18 illustrates the amino acid sequence of SEQ ID NO: 984, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0044]FIG. 19 illustrates the amino acid sequence of SEQ ID NO: 985, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0045]FIG. 20 illustrates the amino acid sequence of SEQ ID NO: 986, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0046]FIG. 21 illustrates the amino acid sequence of SEQ ID NO: 987, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0047]FIG. 22 illustrates the amino acid sequence of SEQ ID NO: 988, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

[0048]FIG. 23 illustrates the amino acid sequence of SEQ ID NO: 989, showing a conserved Hpt domain (boxed) (Grebe and Stock, Adv. Microb. Physiol. 41:139-227, 1999).

DETAILED DESCRIPTION

[0049] The present invention provides isolated and purified polynucleotides that encode polypeptides involved in plant cell signaling. As discussed above, cell signaling is known to play a critical role in the growth and development of plants, and in cellular responses to external stimuli, such as environmental factors and pathogens. Transformation of plants with polynucleotides that encode polypeptides involved in cell signaling may thus be employed to modify properties such as cell proliferation, differentiation, elongation and survival; resistance to disease; and nutrient metabolism.

[0050] As discussed above, histidine kinases (HKs) play important roles in signal transduction systems in microbial organisms and eukaryotes (e.g. plants and yeast), and have well-established uses due to their enzymatic activity—namely the autophosphorylation of an H box histidine residue and donation of the phosphoryl group to a specific aspartate residue in a response regulator. For example, the hybrid HK/RR ETRI is known to be involved in ethylene signal transduction. Modulation of ETRI expression will thus lead to a modification of physiological properties regulated by ethylene, such as fruit ripening, and senescence of leaves and flowers. Modulation of the expression of this protein in transgenic plants may therefore be employed to prolong the useful life of cut flowers by delaying senescence. Additionally, modulation of the expression of ETRI may be used to selectively enhance the senescence of reproductive organs, resulting in engineered sterile plants. Over-expression of the ethylene receptor ligand-binding domain can be used to engineer an ethylene-insensitive phenotype.

[0051] Osmosensor histidine kinases participate in signal transduction in bacteria, yeast and plants. For example, genetic constructs containing domains of osmosensor polypeptides encoded by the inventive polynucleotides can be introduced into plants and expressed to create plants that are tolerant to drought or saline.

[0052] The hybrid HK/RR protein CRE1 is involved in cytokinin signal transduction. Cytokinin has been shown to play critical roles in lateral branching, leaf expansion, cell division, nutrient distribution and delaying senescence, among other physiological and developmental phenomena. Therefore, modulation of the expression of CRE1 may result in, for example, the delay of senescence in selected cell types or organs. This would result in prolonged shelf life for fruits and vegetables between harvest and consumption. Alternatively, modulation of CRE1 expression may be used to decrease branching frequency in forest tree species, resulting in long stretches of valuable knot-free clearwood for use in solid timber furniture and veneers.

[0053] Using the methods and materials of the present invention, the amount of a specific plant cell polypeptide may be modulated by incorporating additional copies of genes encoding the polypeptide into the genome of a target organism, such as a plant. Similarly, an increase or decrease in the amount of the polypeptide may be obtained by transforming the target organism with antisense copies or RNAi expression constructs of such genes. Ligand binding domains of, for example, histidine kinases may be used as “dominant negatives” to bind ligand without transducing signal, thereby preventing or reducing signal transduction. Alternatively, the kinase domain alone may be used as a “dominant positive” to transduce signal in the absence of ligand.

[0054] In one embodiment, the present invention provides isolated polynucleotides encoding, or partially encoding, plant polypeptides that are involved in cell signaling, the polynucleotides being derived from eucalyptus and pine. Specifically, the present invention provides isolated polynucleotides encoding RLKs from Eucalyptus grandis (SEQ ID NO: 2, 8, 9, 11, 15, 18, 19, 21-25, 33, 34, 38, 131-301, 448-463, 848, 858-874, 882-887, 946 and 947) and Pinus radiata (SEQ ID NO: 1, 3-7, 10, 12-14, 16, 17, 20, 26-32, 35-37, 39-41, 302-447, 833-847, 875-881 and 888), and isolated polynucleotides encoding at least one member of a two-component signaling system (HKs, RRs or hybrid HK/RR proteins) from Eucalyptus grandis (SEQ ID NO: 42, 48-52, 55-58, 67, 464-471, 474-478, 850-857, 948-952, 960-967 and 971-974) and Pinus radiata (SEQ ID NO: 43-47, 53, 54, 59-66, 472, 473, 479-481, 849 and 968-970). Complements of such isolated polynucleotides, reverse complements of such isolated polynucleotides and reverse sequences of such isolated polynucleotides are also provided, together with variants of such sequences, as defined below.

[0055] In another embodiment, the present invention provides isolated polypeptides encoded by the inventive polynucleotides. The amino acid sequences encoded by the DNA sequences of SEQ ID NO: 1-59, 63, 64, 66, 67, 131-481, 833-848, 851, 853-888, 946-952, 960-966 and 968-974 are provided in SEQ ID NO: 68-130, 482-832, 889-945, 953-959, 976-982 and 983-989, respectively. The amino acid sequence of SEQ ID NO: 975 is an extended sequence of SEQ ID NO: 910. Thus, in certain embodiments, the inventive polypeptides comprise amino acid sequences selected from the group consisting of 68-130, 482-832, 889-945, 953-959 and 975-989, and variants thereof.

[0056] The locations of open reading frames (ORFs) within the inventive polynucleotide sequences, together with the SEQ ID NO: for the corresponding amino acid sequence, are identified below in Table 1. TABLE 1 SEQ ID NO: SEQ ID NO: Polynucleotide ORF region Polypeptide 1 134-2134 68 2  7-2079 69 3 465-2432 70 4 237-2129 71 5  11-2743 72 6  98-2941 73 7 164-2080 74 8 139-2118 75 9 163-3069 76 10  13-1911 77 11 144-2855 78 12  58-2193 79 13  57-2180 80 47 880-3168 114 262  52-1629 613 456 1141-3102  807 457  73-2055 808 458  49-3477 809 459 383-2536 810 460 203-2104 811 461 108-1979 812 462  80-1918 813 463 224-3196 814 833 223-3213 889 834  57-2474 890 835  64-2097 891 836  70-1101 892 837  19-1572 893 839  2-2998 895 840  26-2620 896 841  59-2149 897 844  62-3562 900 848  72-1871 904 854 74-541 907 855 254-628  908 856 314-3370 909 858 183-3011 911 859  11-2413 912 860  83-1708 913 861  5-3001 914 862  5-2488 915 863 261-2162 916 864  35-1921 917 865 249-2306 918 868  43-3204 921 869 143-2620 922 870 111-2582 923 871  61-2208 924 872  38-1873 925 873  83-2479 926 874 154-2043 927 878 145-1515 931 879 198-2105 932 880 354-3344 933 881  70-2556 934 946 143-2143 953 947  60-3122 954 949  61-3195 956 950 374-3382 957 857  1-777 975 960  96-3818 976 961 553-3546 977 962 553-1995 978 963 427-4239 979 964 359-1033 980 965 149-685  981 966  2-1237 982 968 144-599  983 969 95-550  984 970  2-244 985 971 203-697  986 972  36-485 987 973  38-433 988 974 520-792 989

[0057] The polynucleotides disclosed herein were derived from forestry plant sources, namely Eucalyptus grandis and Pinus radiata. Some of the polynucleotides of the present invention are “partial ” sequences, in that they do not represent a full-length gene encoding a full-length polypeptide. The term “polypeptide encoded by a polynucleotide” as used herein, includes polypeptides encoded by a nucleotide sequence that includes partial isolated DNA sequences of the present invention. Such partial sequences may be extended by analyzing and sequencing various DNA libraries using primers and/or probes and well known hybridization and/or PCR techniques. Partial sequences may be extended until an open reading frame encoding a polypeptide, a full-length polynucleotide and/or gene capable of expressing a polypeptide, or another useful portion of the genome is identified. Such extended sequences, including full-length polynucleotides and genes, are described as “corresponding to” a sequence disclosed herein, or a variant thereof, or a portion of one of the sequences disclosed herein, or a variant thereof, when the extended polynucleotide comprises an identified sequence or its variant, or an identified contiguous portion (x-mer) of one of the sequences disclosed herein, or a variant thereof. Similarly, RNA sequences, reverse sequences, complementary sequences, antisense sequences, and the like, corresponding to the polynucleotides of the present invention, may be routinely ascertained and obtained using the cDNA sequences identified as SEQ ID NOS: 1-67, 131-481, 833-888, 946-952 and 960-974.

[0058] As discussed above, the polynucleotides disclosed herein may contain open reading frames (ORFs) or partial ORFs encoding polypeptides. AdditionallyORFs encoding polypeptides may be identified in extended or full-length sequences corresponding to the polynucleotide sequences disclosed herein. ORFs may be identified using techniques that are well known in the art. These techniques include, for example, analysis for the location of known start and stop codons, most likely reading frame identification based on codon frequencies, etc. Suitable tools and software for ORF analysis are available, for example, on the Internet at the National Institutes of Health NCBI website. ORFs and portions of ORFs may be identified in the polynucleotides of the present invention. Once a partial open reading frame is identified, the polynucleotide may be extended in the area of the partial open reading frame using techniques that are well known in the art until the polynucleotide for the full open reading frame is identified. Thus, ORFs encoding polypeptides may be identified using the polynucleotides of the present invention.

[0059] Once ORFs are identified in the polynucleotides of the present invention, the ORFs may be isolated and/or synthesized. Expressible genetic constructs comprising the ORFs and suitable promoters, initiators, terminators, etc., which are well known in the art, may then be constructed. Such genetic constructs may be introduced into a host cell to express the polypeptide encoded by the open reading frame. Suitable host cells may include various prokaryotic and eukaryotic cells, including plant cells, mammalian cells, bacterial cells, algae and the like.

[0060] Polypeptides encoded by the polynucleotides of the present invention may be expressed and used in various assays to determine their biological activity. Such polypeptides may be used to raise antibodies, to isolate corresponding interacting proteins or other compounds, and to quantitatively determine levels of interacting proteins or other compounds.

[0061] The term “polynucleotide(s),” as used herein, means a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases and includes DNA and corresponding RNA molecules, including hnRNA and mRNA molecules, both sense and anti-sense strands, and comprehends cDNA, genomic DNA and recombinant DNA, as well as wholly or partially synthesized polynucleotides. An hnRNA molecule contains introns and corresponds to a DNA molecule in a generally one-to-one manner. An mRNA molecule corresponds to an hnRNA and DNA molecule from which the introns have been excised. A polynucleotide may consist of an entire gene, or any portion thereof. Operable anti-sense polynucleotides may comprise a fragment of the corresponding polynucleotide, and the definition of “polynucleotide” therefore includes all such operable anti-sense fragments. Anti-sense polynucleotides and techniques involving anti-sense polynucleotides are well known in the art and are described, for example, in Robinson-Benion et al., Methods in Enzymol. 254: 363-375, 1995; and Kawasaki et al., Artific. Organs 20: 836-848, 1996.

[0062] The term “polypeptide”, as used herein, encompasses amino acid chains of any length including full length proteins, wherein amino acid residues are linked by covalent peptide bonds. Polypeptides of the present invention may be purified natural products, or may be produced partially or wholly using recombinant or synthetic techniques. Polypeptides of the present invention may be purified natural products, or may be produced partially or wholly using recombinant techniques. Polypeptides described herein may be produced using recombinant DNA methodologies, specifically, by inserting a DNA sequence that encodes the polypeptide into an expression vector and producing the polypeptide in an appropriate host. Any of a variety of expression vectors known to those of ordinary skill in the art may be employed. Expression may be achieved in any appropriate host cell that has been transformed or transfected with an expression vector containing a polynucleotide that encodes a recombinant polypeptide. The DNA sequences expressed in this manner may encode naturally occurring polypeptides, portions of naturally occurring polypeptides, or other variants thereof.

[0063] In general, the polynucleotides and polypeptides disclosed herein are prepared in an isolated, substantially pure, form. Preferably, the polynucleotides and polypeptides are at least about 80% pure, more preferably at least about 90% pure, and most preferably at least 99% pure.

[0064] The definition of the terms “complement”, “reverse complement” and “reverse sequence”, as used herein, is best illustrated by the following example. For the sequence 5′ AGGACC 3′, the complement, reverse complement and reverse sequence are as follows: complement 3′ TCCTGG 5′ reverse complement 3′ GGTCCT 5′ reverse sequence 5′ CCAGGA 3′.

[0065] As used herein, the term “variant” comprehends nucleotide or amino acid sequences different from the specifically identified sequences, wherein one or more nucleotides or amino acid residues is deleted, substituted, or added. Variants may be naturally occurring allelic variants, or non-naturally occurring variants. In certain embodiments, variants of the inventive sequences retain certain, or all, of the functional characteristics of the inventive sequence. In preferred embodiments, variants of the inventive polynucleotides encode polypeptides that are involved in a cell signaling pathway. Variant sequences (polynucleotide or polypeptide) preferably exhibit at least 50%, more preferably at least 75%, more preferably yet at least 90%, and most preferably 95% identity to a sequence of the present invention. The percentage identity is determined by aligning the two sequences to be compared as described below, determining the number of identical residues in the aligned portion, dividing that number by the total number of residues in the inventive (queried) sequence, and multiplying the result by 100.

[0066] Polynucleotide or polypeptide sequences may be aligned, and percentages of identical nucleotides in a specified region may be determined against another polynucleotide or polypeptide, using computer algorithms that are publicly available. Two exemplary algorithms for aligning and identifying the similarity of polynucleotide sequences are the BLASTN and FASTA algorithms. The alignment and similarity of polypeptide sequences may be examined using the BLASTP algorithm. BLASTX and FASTX algorithms compare nucleotide query sequences translated in all reading frames against polypeptide sequences. The BLASTN, BLASTP and BLASTX algorithms are available on the NCBI anonymous FTP server and are available from the National Center for Biotechnology Information (NCBI), National Library of Medicine, Building 38A, Room 8N805, Bethesda, Md. 20894 U.S.A.

[0067] The FASTA and FASTX algorithms are available on the Internet. The FASTA software package is also available from the University of Virginia by contacting the Assistant Provost for Research, University of Virginia, PO Box 9025, Charlottesville, Va. 22906-9025. The FASTA algorithm, set to the default parameters described in the documentation and distributed with the algorithm, may be used in the determination of polynucleotide variants. The readme files for FASTA and FASTX v1.0 x that are distributed with the algorithms describe the use of the algorithms and describe the default parameters. The use of the FASTA and FASTX algorithms is also described in Pearson, and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85:2444-2448, 1988; and Pearson, Methods in Enzymol. 183:63-98, 1990.

[0068] The BLASTN algorithm version 2.0.4 [Feb. 24-1998 ], 2.0.6 [Sep. 16-1998] and 2.0.11 [Jan. 20-2000], set to the default parameters described in the documentation and distributed with the algorithm, is preferred for use in the determination of polynucleotide variants according to the present invention. The BLASTP algorithm version 2.0.4, 2.0.6 and 25 2.0.11, set to the default parameters described in the documentation and distributed with the algorithm, is preferred for use in the determination of polypeptide variants according to the present invention. The use of the BLAST family of algorithms, including BLASTN, BLASTP and BLASTX is described in the publication of Altschul et al., Nucleic Acids Res. 25:3389-3402, 1997.

[0069] The following running parameters are preferred for determination of alignments and similarities using BLASTN that contribute to the E values and percentage identity for polynucleotides: Unix running command with default parameters thus: blastall -p blastn -d embldb -e 10 -G 0 -E 0 -r 1 -v 30 -b 30 -i queryseq -o results; and parameters are: -p Program Name [String]; -d Database [String]; -e Expectation value (E) [Real]; -G Cost to open a gap (zero invokes default behavior) [Integer]; -E Cost to extend a gap (zero invokes default behavior) [Integer]; -r Reward for a nucleotide match (blastn only) [Integer]; -v Number of one-line descriptions (V) [Integer]; -b Number of alignments to show (B) [Integer]; -i Query File [File In]; -o BLAST report Output File [File Out] Optional. The following running parameters are preferred for determination of alignments and similarities using BLASTP that contribute to the E values and percentage identity for polypeptides: blastall -p blastp -d swissprotdb -e 10 -G 0 -E 0 -v 30 -b 30 -i queryseq -o results; and the parameters are: -p Program Name [String]; -d Database [String]; -e Expectation value (E) [Real]; -G Cost to open a gap (zero invokes default behavior) [Integer]; -E Cost to extend a gap (zero invokes default behavior) [Integer]; -v Number of one-line descriptions (v) [Integer]; -b Number of alignments to show (b) [Integer]; -I Query File [File In]; -o BLAST report Output File [File Out] Optional.

[0070] The “hits” to one or more database sequences by a queried sequence produced by BLASTN, BLASTP, FASTA, or a similar algorithm, align and identify similar portions of sequences. The hits are arranged in order of the degree of similarity and the length of sequence overlap. Hits to a database sequence generally represent an overlap over only a fraction of the sequence length of the queried sequence.

[0071] As noted above, the percentage identity of a polynucleotide or polypeptide sequence is determined by aligning polynucleotide and polypeptide sequences using appropriate algorithms, such as BLASTN or BLASTP, respectively, set to default parameters; identifying the number of identical nucleic or amino acids over the aligned portions; dividing the number of identical nucleic or amino acids by the total number of nucleic or amino acids of the polynucleotide or polypeptide of the present invention; and then multiplying by 100 to determine the percentage identity. By way of example, a queried polynucleotide having 220 nucleic acids has a hit to a polynucleotide sequence in the EMBL database having 520 nucleic acids over a stretch of 23 nucleotides in the alignment produced by the BLASTN algorithm using the default parameters. The 23 nucleotide hit includes 21 identical nucleotides, one gap and one different nucleotide. The percentage identity of the queried polynucleotide to the hit in the EMBL database is thus 21/220 times 100, or 9.5%. The identity of polypeptide sequences may be determined in a similar fashion.

[0072] The BLASTN and BLASTX algorithms also produce “Expect” values for polynucleotide and polypeptide alignments. The Expect value (E) indicates the number of hits one can “expect” to see over a certain number of contiguous sequences by chance when searching a database of a certain size. The Expect value is used as a significance threshold for determining whether the hit to a database indicates true similarity. For example, an E value of 0.1 assigned to a polynucleotide hit is interpreted as meaning that in a database of the size of the EMBL database, one might expect to see 0.1 matches over the aligned portion of the sequence with a similar score simply by chance. By this criterion, the aligned and matched portions of the sequences then have a probability of 90% of being the same. For sequences having an E value of 0.01 or less over aligned and matched portions, the probability of finding a match by chance in the EMBL database is 1% or less using the BLASTN algorithm. E values for polypeptide sequences may be determined in a similar fashion using various polypeptide databases, such as the SwissProt database.

[0073] According to one embodiment, “variant” polynucleotides and polypeptides, with reference to each of the polynucleotides and polypeptides disclosed herein, preferably comprise sequences having the same number or fewer nucleic or amino acids than each of the polynucleotides or polypeptides of the present invention and producing an E value of 0.01 or less when compared to the polynucleotide or polypeptide of the present invention. That is, a variant polynucleotide or polypeptide is any sequence that has at least a 99% probability of being the same as the polynucleotide or polypeptide of the present invention, measured as having an E value of 0.01 or less using the BLASTN or BLASTX algorithms set at the default parameters. According to a preferred embodiment, a variant polynucleotide is a sequence having the same number or fewer nucleic acids than a polynucleotide of the present invention that has at least a 99% probability of being the same as the polynucleotide of the present invention, measured as having an E value of 0.01 or less using the BLASTN algorithm set at the default parameters. Similarly, according to a preferred embodiment, a variant polypeptide is a sequence having the same number or fewer amino acids than a polypeptide of the present invention that has at least a 99% probability of being the same as the polypeptide of the present invention, measured as having an E value of 0.01 or less using the BLASTP algorithm set at the default parameters.

[0074] Variant polynucleotide sequences will generally hybridize to the recited polynucleotide sequences under stringent conditions. As used herein, “stringent conditions” refers to prewashing in a solution of 6× SSC, 0.2% SDS; hybridizing at 65° C., 6× SSC, 0.2% SDS overnight; followed by two washes of 30 minutes each in 1× SSC, 0.1% SDS at 65° C. and two washes of 30 minutes each in 0.2× SSC, 0.1% SDS at 65° C.

[0075] The present invention also encompasses polynucleotides that differ from the disclosed sequences but that, as a consequence of the degeneracy of the genetic code, encode a polypeptide which is the same as that encoded by a polynucleotide disclosed herein. Additionally, polynucleotides comprising sequences that differ from the polynucleotide sequences disclosed herein as a result of deletions and/or insertions totaling less than 10% of the total sequence length are also contemplated by and encompassed within the present invention. Similarly, polypeptides comprising sequences that differ from the polypeptide sequences disclosed herein, as a result of amino acid substitutions, insertions, and/or deletions totaling less than 10% of the total sequence length are contemplated by and encompassed within the present invention.

[0076] As used herein, the term “x-mer,” with reference to a specific value of “x,” refers to a polynucleotide comprising at least a specified number (“x”) of contiguous residues of any of the polynucleotides identified as SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974. The value of x may be from about 20 to about 600, depending upon the specific sequence.

[0077] Polynucleotides of the present invention comprehend polynucleotides comprising at least a specified number of contiguous residues (x-mers) of any of the polynucleotides identified as SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974 or their variants. According to preferred embodiments, the value of x is preferably at least 20, more preferably at least 40, more preferably yet at least 60, and most preferably at least 80. Thus, polynucleotides of the present invention include polynucleotides comprising a 20-mer, a 40-mer, a 60-mer, an 80-mer, a 100-mer, a 120-mer, a 150-mer, a 180-mer, a 220-mer a 250-mer, a 300-mer, a 4 00-mer, a 500-mer or a 600-mer of a polynucleotide identified as SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974, or of a variant of one of the polynucleotides identified as SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974.

[0078] The inventive polynucleotides may be isolated by high throughput sequencing of CDNA libraries prepared from Eucalyptus grandis and Pinus radiata as described below in Examples 1 and 2. Alternatively, oligonucleotide probes based on the sequences provided in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974 can be synthesized and used to identify positive clones in either CDNA or genomic DNA libraries from Eucalyptus grandis and Pinus radiata by means of hybridization or PCR techniques. Probes can be shorter than the sequences provided herein but should be at least about 10, preferably at least about 15 and most preferably at least about 20 nucleotides in length. Hybridization and PCR techniques suitable for use with such oligonucleotide probes are well known in the art, and include those taught by Sambrook et al., Molecular cloning: a laboratory manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. Positive clones may be analyzed by restriction enzyme digestion, DNA sequencing or the like.

[0079] In addition,, the DNA sequences of the present invention may be generated by synthetic means using techniques well known in the art. Equipment for automated synthesis of oligonucleotides is commercially available from suppliers such as Perkin Elmer/Applied Biosystems Division (Foster City, Calif.) and may be operated according to the manufacturer's instructions.

[0080] In one embodiment, the genetic constructs of the present invention include an open reading frame coding for at least a functional portion of a polypeptide encoded by a polynucleotide of the present invention or a variant thereof. As used herein, the “functional portion” of a polypeptide is that portion which contains the binding site and/or the catalytic signal transduction site of the polypeptide. Examples of functional portions of the inventive polypeptides include the domains identified in FIGS. 1-23. The functional portion can be determined by targeted mutagenesis and screening of modified polypeptide products with protocols well known in the art. Normally, the functional portion is 10-20 amino acids in length, but can be shorter or longer. The active site may be made up of separate portions present on one or more polypeptide chains and will generally exhibit high substrate specificity.

[0081] The open reading frame is inserted in the genetic construct in a sense or antisense orientation, such that transformation of a target plant with the genetic construct will lead to a change in the amount of polypeptide compared to the wild-type plant. Transformation with a genetic construct comprising an open reading frame in a sense orientation will generally result in over-expression of the selected gene, while transformation with a genetic construct comprising an open reading frame in an antisense orientation will generally result in reduced expression of the selected gene. A population of plants transformed with a genetic construct comprising an open reading frame of the present invention in either a sense or antisense orientation may be screened for increased or reduced expression of the gene in question using techniques well known to those of skill in the art, and plants having the desired phenotypes may thus be isolated.

[0082] Alternatively, expression of a gene involved in plant cell signaling may be inhibited by inserting a portion of an open reading frame of the present invention, in either sense or antisense orientation, in the genetic construct. Such portions need not be full-length but preferably comprise at least 25 and more preferably at least 50 residues of an inventive polynucleotide. However, a longer portion or even the full-length DNA corresponding to the complete open reading frame may be employed. The portion of the open reading frame does not need to be precisely the same as the endogenous sequence, provided that there is sufficient sequence similarity to achieve inhibition of the target gene. Thus a sequence derived from one species may be used to inhibit expression of a gene in a different species.

[0083] In a second embodiment, the inventive genetic constructs comprise a DNA sequence including an non-coding region of a gene coding for a polypeptide of the present invention, or a DNA sequence complementary to such an non-coding region. Examples of non-coding regions which may be usefully employed in such constructs include introns and 5′-untranslated leader sequences. Transformation of a target plant with such a genetic construct may lead to a reduction in the amount of the polypeptide expressed in the plant by the process of cosuppression, in a manner similar to that discussed, for example, by Napoli et al., Plant Cell 2:279-290, 1990, and de Carvalho Niebel et al., Plant Cell 7:347-358, 1995.

[0084] Alternatively, regulation of polypeptide expression can be achieved by inserting appropriate sequences or subsequences (e.g. DNA or RNA) in ribozyme constructs (McIntyre and Manners, Transgenic Res. 5(4): 257-262, 1996). Ribozymes are synthetic RNA molecules that comprise a hybridizing region complementary to two regions. Preferably, each region comprises at least 5 contiguous nucleotides of a mRNA molecule encoded by one of the inventive polynucleotides. Ribozymes possess highly specific endonuclease activity, which autocatalytically cleaves the mRNA.

[0085] The genetic constructs of the present invention further comprise a gene promoter sequence and a gene termination sequence, operably linked to the DNA sequence to be transcribed, which control expression of the gene. The gene promoter sequence is generally positioned at the 5′ end of the DNA sequence to be transcribed, and is employed to initiate transcription of the DNA sequence. Gene promoter sequences are generally found in the 5′ non-coding region of a gene but they may exist downstream of the open reading frame, in introns (Luehrsen, Mol. Gen. Genet. 225:81-93, 1991) or in the coding region, as for example in a plant defense gene (Douglas et al., EMBO J. 10: 1767-1775, 1991).

[0086] A variety of gene promoter sequences which may be usefully employed in the genetic constructs of the present invention are well known in the art. The gene promoter sequence, and also the gene termination sequence, may be endogenous to the target plant host or may be exogenous, provided the promoter is functional in the target host. For example, the promoter and termination sequences may be from other plant species, plant viruses, bacterial plasmids and the like. Preferably, gene promoter and termination sequences are from the inventive sequences themselves.

[0087] Factors influencing the choice of promoter include the desired tissue specificity of the construct, and the timing of transcription and translation. For example, constitutive promoters, such as the Cauliflower Mosaic Virus 35S (CaMV 35S) promoter, will affect the activity of the enzyme in all parts of the plant. Use of a tissue specific promoter will result in production of the desired sense or antisense RNA only in the tissue of interest. With genetic constructs employing inducible gene promoter sequences, the rate of RNA polymerase binding and initiation can be modulated by external stimuli, such as light, heat, anaerobic stress, alteration in nutrient conditions and the like. Temporally regulated promoters can be employed to effect modulation of the rate of RNA polymerase binding and initiation at a specific time during development of a transformed cell. Preferably, the original promoters from the gene in question, or promoters from a specific tissue-targeted gene in the organism to be transformed, such as eucalyptus or pine, are used. Other examples of gene promoters which may be usefully employed in the present invention include mannopine synthase (mas), octopine synthase (ocs) and those reviewed by Chua et al., Science 244:174-181, 1989.

[0088] The gene termination sequence, which is located 3′ to the DNA sequence to be transcribed, may come from the same gene as the gene promoter sequence or may be from a different gene. Many gene termination sequences known in the art may be usefully employed in the present invention, such as the 3′ end of the Agrobacterium tumefaciens nopaline synthase gene. In one embodiment, terminator sequences are those from the original enzyme gene or from the target species to be transformed.

[0089] The genetic constructs of the present invention may also contain a selection marker that is effective in plant cells, to allow for the detection of transformed cells containing the inventive construct. Such markers, which are well known in the art, typically confer resistance to one or more toxins. One example of such a marker is the NPTII gene whose expression results in resistance to kanamycin or hygromycin, antibiotics which are usually toxic to plant cells at a moderate concentration (Rogers et al., in Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, eds., Academic Press Inc., San Diego, Calif., 1988). Transformed cells can thus be identified by their ability to grow in media containing the antibiotic in question. Alternatively, the presence of the desired construct in transformed cells can be determined by means of other techniques well known in the art, such as Southern and Western blots.

[0090] Techniques for operatively linking the components of the inventive genetic constructs are well known in the art and include the use of synthetic linkers containing one or more restriction endonuclease sites as described, for example, by Sambrook et al., Molecular cloning: a laboratory manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. The genetic construct of the present invention may be linked to a vector having at least one replication system, for example Escherichia coli, whereby after each manipulation, the resulting construct can be cloned and sequenced and the correctness of the manipulation determined.

[0091] The genetic constructs of the present invention may be used to transform a variety of target organisms, including plants, both monocotyledonous angiosperms (e.g. grasses, corn, grains, oat, wheat and barley), dicotyledonous angiosperms (e.g. Arabidopsis, tobacco, legumes, alfalfa, oaks, eucalyptus, maple), and gymnosperms (e.g. Scots pine (Aronen, Finnish Forest Res. Papers, vol. 595, 1996), white spruce (Ellis et al., Biotechnology 11: 84-89, 1993), and larch (Huang et al., In Vitro Cell 27:201-207, 1991). In a preferred embodiment, the inventive genetic constructs are employed to transform woody plants, herein defined as a perennial tree or shrub whose stem increases in diameter each year by the addition of woody tissue. Preferably the target plant is selected from the group consisting of eucalyptus and pine species, most preferably from the group consisting of Eucalyptus grandis and Pinus radiata. Other species which may be usefully transformed with the genetic constructs of the present invention include, but are not limited to: pines such as Pinus banksiana, Pinus brutia, Pinus caribaea, Pinus clausa, Pinus contorta, Pinus coulteri, Pinus echinata, Pinus eldiarica, Pinus ellioti, Pinus jeffreyi, Pinus lambertiana, Pinus monticola, Pinus nigra, Pinus palustrus, Pinus pinaster, Pinus ponderosa, Pinus resinosa, Pinus rigida, Pinus serotina, Pinus strobus, Pinus sylvestris, Pinus taeda, Pinus virginiana; other gymnosperms, such as Abies amabilis, Abies balsamea, Abies concolor, Abies grandis, Abies lasiocarpa, Abies magnifica, Abies procera, Chamaecyparis lawsoniona, Chamaecyparis nootkatensis, Chamaecyparis thyoides, Juniperus virginiana, Larix decidua, Larix laricina, Larix leptolepis, Larix occidentalis, Larix siberica, Libocedrus decurrens, Picea abies, Picea engelmanni, Picea glauca, Picea mariana, Picea pungens, Picea rubens, Picea sitchensis, Pseudotsuga menziesii, Sequoia gigantea, Sequoia sempervirens, Taxodium distichum, Tsuga canadensis, Tsuga heterophylla, Tsuga mertensiana, Thuja occidentalis, Thuja plicata; Eucalypts, such as Eucalyptus alba, Eucalyptus bancroftii, Eucalyptus botyroides, Eucalyptus bridgesiana, Eucalyptus calophylla, Eucalyptus camaldulensis, Eucalyptus citriodora, Eucalyptus cladocalyx, Eucalyptus coccifera, Eucalyptus curtisii, Eucalyptus dalrympleana, Eucalyptus deglupta, Eucalyptus delagatensis, Eucalyptus diversicolor, Eucalyptus dunnii, Eucalyptus ficifolia, Eucalyptus globulus, Eucalyptus gomphocephala, Eucalyptus gunnii, Eucalyptus henryi, Eucalyptus laevopinea, Eucalyptus macarthurii, Eucalyptus macrorhyncha, Eucalyptus maculata, Eucalyptus marginata, Eucalyptus megacarpa, Eucalyptus melliodora, Eucalyptus nicholii, Eucalyptus nitens, Eucalyptus nova-anglica, Eucalyptus obliqua, Eucalyptus obtusiflora, Eucalyptus oreades, Eucalyptus pauciflora, Eucalyptus polybractea, Eucalyptus regnans, Eucalyptus resinifera, Eucalyptus robusta, Eucalyptus rudis, Eucalyptus saligna, Eucalyptus sideroxylon, Eucalyptus stuartiana, Eucalyptus tereticornis, Eucalyptus torelliana, Eucalyptus urnigera, Eucalyptus urophylla, Eucalyptus viminalis, Eucalyptus viridis, Eucalyptus wandoo and Eucalyptus youmanni; and hybrids between any of the above species.

[0092] Techniques for stably incorporating genetic constructs into the genome of target plants are well known in the art and include Agrobacterium tumefaciens-mediated introduction, electroporation, protoplast fusion, injection into reproductive organs, injection into immature embryos, high velocity projectile introduction and the like. The choice of technique will depend upon the target plant to be transformed. For example, dicotyledonous plants and certain monocots and gymnosperms may be transformed by Agrobacterium Ti plasmid technology, as described, for example by Bevan, Nucleic Acids Res. 12:8711-8721, 1984. Targets for the introduction of the genetic constructs of the present invention include tissues, such as leaf tissue, dissociated cells, protoplasts, seeds, embryos, meristematic regions; cotyledons, hypocotyls, and the like. One method for transforming eucalyptus and pine is a biolistic method using pollen (see, for example, Aronen, Finnish Forest Res. Papers vol. 595, 53pp, 1996) or easily regenerable embryonic tissues.

[0093] Once the cells are transformed, cells having the inventive genetic construct incorporated in their genome may be selected by means of a marker, such as the kanamycin resistance marker discussed above. Transgenic cells may then be cultured in an appropriate medium to regenerate whole plants, using techniques well known in the art. In the case of protoplasts, the cell wall is allowed to reform under appropriate osmotic conditions. In the case of seeds or embryos, an appropriate germination or callus initiation medium is employed. For explants, an appropriate regeneration medium is used. Regeneration of plants is well established for many species. For a review of regeneration of forest trees see Dunstan et al., Somatic embryogenesis in woody plants. In: Thorpe, T. A. ed., In Vitro Embryogenesis of Plants. Vol. 20 in Current Plant Science and Biotechnology in Agriculture, Chapter 12, pp. 471-540, 1995. Specific protocols for the regeneration of spruce are discussed by Roberts et al., Somatic Embryogenesis of Spruce. In: Synseed. Applications of synthetic seed to crop improvement. Redenbaugh, K, ed., CRC Press, Chapter 23, pp. 427-449, 1993. The resulting transformed plants may be reproduced sexually or asexually, using methods well known in the art, to give successive generations of transgenic plants.

[0094] As discussed above, the production of RNA in target plant cells can be controlled by choice of the promoter sequence. A target plant may be transformed with more than one genetic construct of the present invention, thereby modulating the activity of more than one polypeptide, affecting polypeptide activity in more than one tissue, or affecting polypeptide activity at more than one expression time. Similarly, a genetic construct may be assembled containing more than one open reading frame coding for an inventive polypeptide or more than one non-coding region of a gene coding for such a polypeptide. The polynucleotides of the present invention may also be employed in combination with other known sequences encoding polypeptides involved in plant cell signaling.

[0095] The isolated polynucleotides of the present invention may be employed as probes to isolate DNA sequences encoding polypeptides involved in cell signaling from other plant species, using techniques well known to those of skill in the art, such as routinely used DNA hybridization and PCR techniques.

[0096] The inventive polynucleotides, polypeptides and antibodies to such polypeptides may be used to screen for molecules that interact with such polynucleotides and/or polypeptides and that thereby modulate cell signaling. Techniques for performing such assays are well known in the art. Similarly, the polynucleotides and polypeptides of the present invention may be employed in studies designed to elucidate the mechanism of cell signaling pathways.

[0097] The following examples are offered by way of illustration and not by way of limitation.

EXAMPLE 1 Isolation and Characterization of cDNA Clones from Eucalyptus grandis

[0098]Eucalyptus grandis cDNA expression libraries were constructed and screened as follows.

[0099] mRNA was extracted from specific plant tissues, such as trunk xylem, using the protocol of Chang et al., Plant Molecular Biology Reporter 11:113-116, 1993, with minor modifications. Specifically, samples were dissolved in CPC-RNAXB (100 mM Tris-Cl, pH 8,0; 25 mM EDTA; 2.0 M NaCl; 2% CTAB; 2% PVP and 0.05% Spermidine*3HCl) and extracted with chloroform:isoamyl alcohol, 24:1. mRNA was precipitated with ethanol and the total RNA preparation was purified using a Poly(A) Quik mRNA Isolation Kit (Stratagene, La Jolla, Calif.). A cDNA expression library was constructed from the purified mRNA by reverse transcriptase synthesis followed by insertion of the resulting cDNA clones in Lambda ZAP using a ZAP Express cDNA Synthesis Kit (Stratagene), according to the manufacturer's protocol. The resulting cDNAs were packaged using a Gigapack II Packaging Extract (Stratagene) employing 1 μl of sample DNA from the 5 μl ligation mix. Mass excision of the library was done using XL1-Blue MRF′ cells and XLOLR cells (Stratagene) with ExAssist helper phage (Stratagene). The excised phagemids were diluted with NZY broth (Gibco BRL, Gaithersburg, MD) and plated out onto LB-kanamycin agar plates containing 5-bromo-4-chloro-3-indolyl-β-D-galactoside (X-gal) and isopropylthio-beta-galactoside (IPTG).

[0100] Of the colonies plated and picked for DNA miniprep, 99% contained an insert suitable for sequencing. Positive colonies were cultured in NZY broth with kanamycin and cDNA was purified by means of alkaline lysis and polyethylene glycol (PEG) precipitation. Agarose gel at 1% was used to screen sequencing templates for chromosomal contamination. Dye primer sequences were prepared using a Turbo Catalyst 800 machine (Perkin Elmer/Applied Biosystems Division, Foster City, Calif.) according to the manufacturer's protocol.

[0101] DNA sequence for positive clones was obtained using a Perkin Elmer/Applied Biosystems Division Prism 377 sequencer. cDNA clones were sequenced first from the 5′ end and, in some cases, also from the 3′ end. For some clones, internal sequence was obtained by designing primers to hybridize to the ends of known sequences, and using these as sequencing primers extending the amount of sequence information. This procedure was repeated iteratively until the complete sequence was obtained. Alternatively, internal sequence was obtained by generating “nested” deletion clones of the gene of interest using published methods (Henikoff, Gene 28:351-359, 1984).

[0102] The determined cDNA sequence was compared to known sequences in the EMBL database (Release 58, Mar. 1999) using the computer algorithms FASTA and/or BLASTN. Multiple alignments of redundant sequences were used to build up reliable consensus sequences. Based on similarity to known sequences from other plant species, the isolated DNA sequences were identified as encoding RLKs (SEQ ID NO: 2, 8, 9, 11, 15, 18, 19, 21-25, 33, 34, 38, 131-301, 448-463, 848, 858-874, 882-887, 946 and 947) or at least one member of a two-component signaling system (HKs, RRs or hybrid HK/RR proteins; SEQ ID NO: 42, 48-52, 55-58, 67, 464-471, 474-478, 850-857, 948-952 and 960-974). The sequences of SEQ ID NO: 2, 8, 9, 11, 15, 18, 19, 21-25, 33, 34, 38, 946 and 947 were found to have less than 10% identical residues (determined as described above) to known sequences. In addition, the 5′ UTR regions of SEQ ID NO: 856, 961 and 963 are each believed to contain an internal ribosome entry site (IRES). IRESs may be employed to control gene expression without the use of specific promoters (Sachs, Cell, 101:243-245, 2000). SEQ ID NO: 848, 854, 855, 856, 859, 860, 862, 863, 864, 865, 866, 867, 868, 869, 871, 872, 873, 874, 882, 883, 885, 946, 947, 948, 949, 950, 951, 960, 961, 963 and 967 represent extended sequences of SEQ ID NO: 232, 467, 468, 48, 282, 288, 488, 453, 289, 268, 297, 278, 290., 449, 299, 301, 270, 269, 276, 454, 300, 450, 280, 851, 49, 470, 57, 50, 948, 951 and 854, respectively. SEQ ID NO: 962 is a splice variant of SEQ ID NO: 961. The sequences of SEQ ID NO: 848, 854-856, 859, 860, 862-865, 868, 869, 871-874, 946, 947, 949, 950, 960-964, 967 and 971-973 are believed to be full-length sequences, in that they each contain a complete open reading frame.

EXAMPLE 2 Isolation and Characterization of cDNA Clones from Pinus radiata

[0103] A Pinus radiata cDNA expression library was constructed from specific tissues, such as xylem, and screened as described above in Example 1. DNA sequence for positive clones was obtained using forward and reverse primers on a Perkin Elmer/Applied Biosystems Division Prism 377 sequencer and the determined sequences were compared to known sequences in the database as described above.

[0104] Based on similarity to known sequences from other plant species, the isolated DNA sequences were identified as encoding RLKs (SEQ ID NO: 1, 3-7, 10, 12-14, 16, 17, 20, 26-32, 35-37, 39-41, 302-447, 833-847, 875-881 and 888) or at least one member of a two-component signaling system (HKs, RRs or hybrid HK/RR proteins; SEQ ID NO: 43-47, 53, 54, 59-66, 472, 473, 479-481, 849 and 968-970). The sequences of SEQ ID NO: 3-7, 10, 12-14, 16, 17, 20, 26, 28-32, 35-37 and 39-41 were found to have less than 10% identical residues (determined as described above) to known sequences. In addition, the 5′ UTR region of SEQ ID NO: 47 is believed to contain an internal ribosome entry site (IRES). As noted above, IRESs may be employed to control gene expression without the use of specific promoters (Sachs, Cell, 101:243-245, 2000).

[0105] The sequence of SEQ ID NO: 480 was found to contain a putative unspliced intron and the translation is split into two ORFs. The predicted amino acid sequences encoded by these two ORFs are provided in SEQ ID NO: 830 and 831. SEQ ID NO: 411, 413, 317, 421, 415, 434 and 416 represent extended sequences of SEQ ID NO: 26, 17, 28, 39, 16, 30 and 41, respectively. SEQ ID NO: 833, 834, 835, 836, 837, 838, 839, 840, 841, 842, 843, 844, 875, 876, 877, 878, 879, 880 and 888 represent extended sequences of SEQ ID NO: 411, 413, 317, 29, 421, 415, 434, 416, 35, 37, 36, 40, 438, 426, 445, 418, 435, 411 and 427, respectively. The sequences of SEQ ID NO: 833-837, 839-841, 844, 878-881, 968 and 969 are believed to be full-length sequences in that they contain a complete open reading frame.

EXAMPLE 3 Use of an Ethylene Receptor Gene to Modify Plant Growth

[0106] Transformation of tobacco plants with a Pinus radiata ethylene receptor gene homolog is performed as follows. Genetic constructs comprising sense and anti-sense constructs containing a DNA sequence including the coding region of an ethylene receptor homolog (SEQ ID NO: 43) from Pinus radiata are constructed and inserted into Agrobacterium tumefaciens by direct transformation using published methods (An: Binary Vectors. In: Gelvin SB, Schilperoort RA, eds., Plant Molecular Biology Manual, Kluwer Academic Publishers, Dordrecht, 1988. The constructs of sense DNA are made by cloning PBK-CMV plasmid cDNA inserts into pART7 plasmids, followed by cloning of the NotI-digested 35S-Insert-OCS 3′ UTR-fragments from the pART7 vectors into pART27 plant expression vectors; (Gleave, Plant Mol. Biol. 20: 1203-1207, 1992). The presence and integrity of the transgenic constructs are verified by restriction digestion and DNA sequencing.

[0107] Tobacco (Nicotiana tabacum cv. Samsun) leaf sections are transformed with the sense and anti-sense ethylene receptor constructs using a method based on that of Horsch et al., Science 227:1229-1231, 1985. Transformed plants containing the appropriate construct are verified using Southern blot experiments. Expression of the Pinus ethylene receptor homolog in transformed plants is confirmed by isolating total RNA from each independent transformed plant line created with the sense and anti-sense constructs. The RNA samples are analyzed in Northern blot experiments to determine the level of expression of the transgene in each transformed line. The expression level of the ethylene receptor polypeptide, encoded by the Pinus ethylene receptor gene and by the endogenous tobacco ethylene receptor gene, for each transformed plant line created with the sense and anti-sense constructs is compared to that of wild-type control plants.

[0108] SEQ ID NOS: 1-989 are set out in the attached Sequence Listing. The codes for nucleotide sequences used in the attached Sequence Listing, including the symbol “n,” conform to WIPO Standard ST.25 (1998), Appendix 2, Table 1.

[0109] All references cited herein, including patent references and non-patent publications, are hereby incorporated by reference in their entireties.

[0110] Although the present invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be apparent to those skilled in the art that changes and modifications can be carried out without departing from the scope of the invention which is intended to be limited only by the scope of the claims. 

1. An isolated polynucleotide comprising a sequence selected from the group consisting of: (a) sequences provided in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (b) complements of the sequence recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (c) reverse complements of the sequence recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (d) reverse sequences of the sequences recited in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974; (e) sequences having at least 75% identity to a sequence provided in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974 as determined using the computer algorithm BLASTN; (f) sequences having at least 90% identity to a sequence provided in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974 as determined using the computer algorithm BLASTN; (g) sequences having at least 95% identity to a sequence provided in SEQ ID NO: 1-67, 131-481, 833-888, 946-952 and 960-974 as determined using the computer algorithm BLASTN; (h) nucleotide sequences that are 200-mers of a sequence recited in (a)-(d) above; (i) nucleotide sequences that are 100-mers of a sequence recited in (a)-(d) above; and (j) nucleotide sequences that are 40-mers of a sequence recited in (a)-(d) above.
 2. An isolated polypeptide encoded by a polynucleotide according to claim
 1. 3. An isolated polypeptide comprising an amino acid sequence selected from the group consisting of: (a) sequences of SEQ ID NO: 68-130, 482-832, 889-945, 953-959 and 975-989; (b) sequences having at least 75% identity to a sequence of (a); (c) sequences having at least 90% identity to a sequence of (a); and (d) sequences having at least 95%) identity to a sequence of (a).
 4. An isolated polynucleotide that encodes a polypeptide according to claim
 3. 5. A genetic construct comprising a polynucleotide according to claim
 1. 6. A transgenic cell comprising a genetic construct according to claim
 5. 7. A genetic construct comprising, in the 5′-3′ direction: (a) a gene promoter sequence, (b) an open reading frame coding for at least a functional portion of a polypeptide encoded by an isolated polynucleotide of claim 1; and (c) a gene termination sequence.
 8. The genetic construct of claim 7 wherein the open reading frame is in a sense orientation.
 9. The genetic construct of claim 7 wherein the open reading frame is in an antisense orientation.
 10. The genetic construct of claim 7 wherein the gene promoter sequence and gene termination sequences are functional in a plant host.
 11. The genetic construct of claim 7 further comprising a marker for identification of transformed cells.
 12. A genetic construct comprising, in the 5′-3′ direction: (a) a gene promoter sequence, (b) a non-coding region of an isolated polynucleotide of claim 1; and (c) a gene termination sequence.
 13. The genetic construct of claim 12 wherein the non-coding region is in a sense orientation.
 14. The genetic construct of claim 12 wherein the non-coding region is in an antisense orientation.
 15. The genetic construct of claim 12 wherein the gene promoter sequence and gene termination sequences are functional in a plant host.
 16. A transgenic plant cell comprising a genetic construct of any one of claims 7-15.
 17. A plant comprising a transgenic plant cell according to claim 16, or a part, propagule or progeny thereof.
 18. The plant of claim 17 wherein the plant is a woody plant.
 19. The plant of claim 18 wherein the plant is selected from the group consisting of eucalyptus, pine, acacia, poplar, sweetgum, teak and mahogany species.
 20. A method for modifying cell signaling in a plant comprising stably incorporating into the genome of the plant a genetic construct according to any one of claims 7-15.
 21. The method of claim 20 wherein the plant is a woody plant.
 22. The method of claim 21, wherein the plant is selected from the group consisting of eucalyptus, pine, acacia, poplar, sweetgum, teak and mahogany species.
 23. A method for producing a plant having modified cell signaling comprising: (a) transforming a plant cell with a genetic construct according to any one of claims 7-15 to provide a transgenic cell; and (b) cultivating the transgenic cell under conditions conducive to regeneration and mature plant growth.
 24. The method of claim 23 wherein the plant is a woody plant.
 25. The method of claim 24 wherein the plant is selected from the group consisting of eucalyptus, pine, acacia, poplar, sweetgum, teak and mahogany species.
 26. A method for modifying the activity of a polypeptide in a plant comprising stably incorporating into the genome of the plant a genetic construct according to any one of claims 7-15.
 27. The method of claim 26 wherein the plant is a woody plant.
 28. The method of claim 27 wherein the plant is selected from the group consisting of eucalyptus, pine, acacia, poplar, sweetgum, teak and mahogany species.
 29. An isolated oligonucleotide probe or primer comprising at least 10 contiguous residues complementary to 10 contiguous residues of a nucleotide sequence recited in claim
 1. 30. A kit comprising a plurality of oligonucleotide probes or primers of claim
 29. 31. A storage medium having recorded thereon a plurality of polynucleotides, at least one of the polynucleotides comprising a nucleotide sequence recited in claim
 1. 